Search CORE

76 research outputs found

Applying Rule Ensembles to the Search for Super-Symmetry at the Large Hadron Collider

Author: A.J. Smola
ALICE collaboration
ATLAS collaboration
ATLAS collaboration
B.P. Roe
CMS collaboration
Fredrik Tegenfeldt
H.B. Prosper
H.P. Nilles
J.H. Friedman
J.H. Friedman
J.H. Friedman
J.H. Friedman
Jan Conrad
L. Breiman
LHCb collaboration
O. Bruning
Y. Freund
Publication venue: 'IOP Publishing'
Publication date: 10/05/2006
Field of study

In this note we give an example application of a recently presented predictive learning method called Rule Ensembles. The application we present is the search for super-symmetric particles at the Large Hadron Collider. In particular, we consider the problem of separating the background coming from top quark production from the signal of super-symmetric particles. The method is based on an expansion of base learners, each learner being a rule, i.e. a combination of cuts in the variable space describing signal and background. These rules are generated from an ensemble of decision trees. One of the results of the method is a set of rules (cuts) ordered according to their importance, which gives useful tools for diagnosis of the model. We also compare the method to a number of other multivariate methods, in particular Artificial Neural Networks, the likelihood method and the recently presented boosted decision tree method. We find better performance of Rule Ensembles in all cases. For example for a given significance the amount of data needed to claim SUSY discovery could be reduced by 15 % using Rule Ensembles as compared to using a likelihood method.Comment: 24 pages, 7 figures, replaced to match version accepted for publication in JHE

arXiv.org e-Print Archive

Crossref

CERN Document Server

Reproducing Kernels of Generalized Sobolev Spaces via a Green Function Approach with Distributional Operators

Author: A. Berlinet
A. Bouhamidi
A. Bouhamidi
A.J. Smola
B. Schölkopf
D.G. Schweikert
E.M. Stein
G. Wahba
G.E. Fasshauer
Gregory E. Fasshauer
H. Wendland
J. Duchon
J. Kybic
M.D. Buhmann
M.L. Stein
Qi Ye
R. Schaback
R.A. Adams
W.A. Light
W.R. Madych
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 04/03/2013
Field of study

In this paper we introduce a generalized Sobolev space by defining a semi-inner product formulated in terms of a vector distributional operator

\mathbf{P}

consisting of finitely or countably many distributional operators

P_n

, which are defined on the dual space of the Schwartz space. The types of operators we consider include not only differential operators, but also more general distributional operators such as pseudo-differential operators. We deduce that a certain appropriate full-space Green function

G

with respect to

L:=\mathbf{P}^{\ast T}\mathbf{P}

now becomes a conditionally positive definite function. In order to support this claim we ensure that the distributional adjoint operator

\mathbf{P}^{\ast}

\mathbf{P}

is well-defined in the distributional sense. Under sufficient conditions, the native space (reproducing-kernel Hilbert space) associated with the Green function

G

can be isometrically embedded into or even be isometrically equivalent to a generalized Sobolev space. As an application, we take linear combinations of translates of the Green function with possibly added polynomial terms and construct a multivariate minimum-norm interpolant

s_{f,X}

to data values sampled from an unknown generalized Sobolev function

f

at data sites located in some set

X \subset \mathbb{R}^d

. We provide several examples, such as Mat\'ern kernels or Gaussian kernels, that illustrate how many reproducing-kernel Hilbert spaces of well-known reproducing kernels are isometrically equivalent to a generalized Sobolev space. These examples further illustrate how we can rescale the Sobolev spaces by the vector distributional operator

\mathbf{P}

. Introducing the notion of scale as part of the definition of a generalized Sobolev space may help us to choose the "best" kernel function for kernel-based approximation methods.Comment: Update version of the publish at Num. Math. closed to Qi Ye's Ph.D. thesis (\url{http://mypages.iit.edu/~qye3/PhdThesis-2012-AMS-QiYe-IIT.pdf}

arXiv.org e-Print Archive

Crossref

Learning to Exploit Proximal Force Sensing: A Comparison Approach

Author: A.J. Smola
D. Marquardt
D. Nguyen-Tuong
J. Denavit
J. Swevers
J.A.K. Suykens
K. Kozlowski
K. Levenberg
L. Sciavicco
M. Lagarde
M. Stinchombe
M.T. Hagan
R.O. Duda
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Crossref

Archivio istituzionale della ricerca - Università di Genova

Dimensionality reduction and prediction of the protein macromolecule dissolution profile

Author: A. Gopferich
A. Hyvarinen
A. Hyvarinen
A.J. Smola
C.E. Astete
C.E. Rasmussen
J. Kang
J. Kang
J. Platt
J. Siepmann
J. Szlkek
K. Zygourakis
L.J. Maaten van der
M. Blanco
R.M. Mainardes
S. Fredenberg
S. Haykin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

A suitable regression model for predicting the dissolution profile of Poly (lactic-co-glycolic acid) (PLGA) micro-and nanoparticles can play a significant role in pharmaceutical/medical applications. The rate of dissolution of proteins is influenced by several factors and taking all such influencing factors into account; we have a dataset in hand with three hundred input features. Therefore, a primary approach before identifying a regression model is to reduce the dimensionality of the dataset at hand. On the one hand, we have adopted Backward Elimination Feature selection techniques for an exhaustive analysis of the predictability of each combination of features. On the other hand, several linear and non-linear feature extraction methods are used in order to extract a new set of features out of the available dataset. A comprehensive experimental analysis for the selection or extraction of features and identification of the corresponding prediction model is offered. The designed experiment and prediction models offer substantially better performance over the earlier proposed prediction models in literature for the said problem

Central Archive at the University of Reading

CiteSeerX

Crossref

Isometric Sliced Inverse Regression for Nonlinear Manifolds Learning

Author: A.A. Alizadeh
A.J. Smola
B. Schölkopf
C. Williams
C.G. Li
C.H. Chen
C.H. Chen
C.H. Chen
C.M. Setodji
D. Singh
D.L. Donoho
E. Bura
H.M. Wu
H.M. Wu
H.M. Wu
H.M. Wu
Han-Ming Wu
J. Ham
J. Khan
J. Nilsson
J.A. Hartigan
J.B. Tenenbaum
K. Fukumizu
K.C. Li
K.Q. Weinberger
L. Li
L. Li
L. Ni
L.K. Saul
M. Aizerman
M. Balasubramanian
M. Belkin
M. Dettling
M. Garber
M. Vlachos
O. Samko
Q. Wu
R.D. Cook
R.D. Cook
R.D. Cook
R.D. Cook
R.D. Cook
R.D. Cook
R.R. Coifman
S. Roweis
S.L. Pomeroy
T. Hastie
T. Hastie
T. Hsing
T.F. Cox
T.R. Golub
U. Alon
U. Gather
W. Bian
W. Zhong
Wei-Ting Yao
X. Gaoa
X. Geng
Y. Bengio
Y.J. Lee
Y.J. Tien
Y.R. Yeh
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

[[abstract]]Sliced inverse regression (SIR) was developed to find effective linear dimension-reduction directions for exploring the intrinsic structure of the high-dimensional data. In this study, we present isometric SIR for nonlinear dimension reduction, which is a hybrid of the SIR method using the geodesic distance approximation. First, the proposed method computes the isometric distance between data points; the resulting distance matrix is then sliced according to K-means clustering results, and the classical SIR algorithm is applied. We show that the isometric SIR (ISOSIR) can reveal the geometric structure of a nonlinear manifold dataset (e.g., the Swiss roll). We report and discuss this novel method in comparison to several existing dimension-reduction techniques for data visualization and classification problems. The results show that ISOSIR is a promising nonlinear feature extractor for classification applications.[[incitationindex]]SCI[[booktype]]紙本[[booktype]]電子

Crossref

Tamkang University Institutional Repository

A Short Introduction to Learning with Kernels

Author: Mendelson S.
Schölkopf B.
Smola A.J.
Smola A.J.
Publication venue: Springer
Publication date
Field of study

Bayesian Kernel Methods

Author: Mendelson S.
Schölkopf B.
Smola A.J.
Smola A.J.
Publication venue: Springer
Publication date
Field of study

Learning with Kernels

Author: Schölkopf B.
Smola A.J.
Publication venue: MIT Press
Publication date
Field of study

Simpler knowledge-based support vector machines

Author: Gärtner T.
Le Q.V.
Smola A.J.
Publication venue
Publication date: 01/01/2006
Field of study

Crossref

Fraunhofer-ePrints

Classification in a normalized feature space using support vector machines

Author: A.B.A. Graf
A.J. Smola
S. Borer
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref